Picture for Mengdi Wang

Mengdi Wang

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

Add code
Feb 02, 2026
Viaarxiv icon

Avenir-Web: Human-Experience-Imitating Multimodal Web Agents with Mixture of Grounding Experts

Add code
Feb 02, 2026
Viaarxiv icon

Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents

Add code
Feb 02, 2026
Viaarxiv icon

FutureX-Pro: Extending Future Prediction to High-Value Vertical Domains

Add code
Jan 18, 2026
Viaarxiv icon

Deep Delta Learning

Add code
Jan 01, 2026
Viaarxiv icon

CubeBench: Diagnosing Interactive, Long-Horizon Spatial Reasoning Under Partial Observations

Add code
Dec 30, 2025
Viaarxiv icon

Web World Models

Add code
Dec 29, 2025
Viaarxiv icon

Monadic Context Engineering

Add code
Dec 27, 2025
Viaarxiv icon

GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators

Add code
Dec 23, 2025
Viaarxiv icon

From Word to World: Can Large Language Models be Implicit Text-based World Models?

Add code
Dec 21, 2025
Viaarxiv icon